Skip to content

Add puzzletron eval skill#1802

Open
danielkorzekwa wants to merge 3 commits into
dkorzekwa/claude_qwen35from
dkorzekwa/claude_qwen35_eval
Open

Add puzzletron eval skill#1802
danielkorzekwa wants to merge 3 commits into
dkorzekwa/claude_qwen35from
dkorzekwa/claude_qwen35_eval

Conversation

@danielkorzekwa

Copy link
Copy Markdown
Contributor

What does this PR do?

NOTE: First merge Create adding_new_model_tutorial.md by danielkorzekwa · Pull Request #1784 · NVIDIA/Model-Optimizer into main and then main into this branch.

Add a skill to evaluate a puzzletron-compressed model for mmlu

Usage

See from step 7 in the add new puzzletron model tutorial

Testing

Tested manually. Please go over the tutorial before approving this MR.

Before your PR is "Ready for review"

  • Is this change backward compatible?: ✅
  • Did you write any new necessary tests?: N/A tested manually
  • Did you update Changelog?: ✅

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>
Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>
Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>
@coderabbitai

coderabbitai Bot commented Jun 23, 2026

Copy link
Copy Markdown
Contributor

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

🗂️ Base branches to auto review (3)
  • main
  • release/.*
  • feature/.*

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 4df32053-6b89-4b9e-b8d4-7c4b7bc8397b

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch dkorzekwa/claude_qwen35_eval

Comment @coderabbitai help to get the list of available commands.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants